Recognising and Interpreting Named Temporal Expressions
نویسندگان
چکیده
This paper introduces a new class of temporal expression – named temporal expressions – and methods for recognising and interpreting its members. The commonest temporal expressions typically contain date and time words, like April or hours. Research into recognising and interpreting these typical expressions is mature in many languages. However, there is a class of expressions that are less typical, very varied, and difficult to automatically interpret. These indicate dates and times, but are harder to detect because they often do not contain time words and are not used frequently enough to appear in conventional temporally-annotated corpora – for example Michaelmas or Vasant Panchami. Using Wikipedia and linked data, we automatically construct a resource of English named temporal expressions, and use it to extract training examples from a large corpus. These examples are then used to train and evaluate a named temporal expression recogniser. We also introduce and evaluate rules for automatically interpreting these expressions, and we observe that use of the rules improves temporal annotation performance over existing corpora.
منابع مشابه
TIMEN: An Open Temporal Expression Normalisation Resource
Temporal expressions are words or phrases that describe a point, duration or recurrence in time. Automatically annotating these expressions is a research goal of increasing interest. Recognising them can be achieved with supervised machine learning, but interpreting them accurately (normalisation) is a complex task requiring human knowledge. In this paper, we present TIMEN, a community-driven t...
متن کاملA Cascaded Machine Learning Approach to Interpreting Temporal Expressions
A new architecture for identifying and interpreting temporal expressions is introduced, in which the large set of complex hand-crafted rules standard in systems for this task is replaced by a series of machine learned classifiers and a much smaller set of context-independent semantic composition rules. Experiments with the TERN 2004 data set demonstrate that overall system performance is compar...
متن کاملA Greek Named-Entity Recognizer That Uses Support Vector Machines and Active Learning
Wepresent a named-entity recognizer for Greek person names and temporal expressions. For temporal expressions, it relies on semiautomatically produced patterns. For person names, it employs two Support Vector Machines, that scan the input text in two passes, and active learning, which reduces the human annotation effort during training.
متن کاملNamed Entity Recognition in Greek Texts with an Ensemble of SVMs and Active Learning
We present a freely available named-entity recognizer for Greek texts that identifies temporal expressions, person, and organization names. For temporal expressions, it relies on semi-automatically produced patterns. For person and organization names, it employs an ensemble of Support Vector Machines that scan the input text in two passes. The ensemble is trained using active learning, whereby ...
متن کاملThe Multilingual Entity Task a Descriptive Analysis of Enamex in Spanish
1. Introduction. The task involved identifying and typing all named entity expressions (ENAMEX), numerical entity expressions (NUMEX), and temporal entity expressions (TIMEX) in Spanish news articles. The analysis of the data suggests that focusing on the high frequency expressions results in a higher payoff. This report looks primarily at ENAMEX expressions because they accounted for nearly th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013